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Amendments to the Specification 

In the Specification: 

Please cancel the existing sequence listing for the above-identified application, and 
replace it with the substitute sheets appended hereto. Please renumber the remaining pages 
accordingly. A computer readable copy of the substitute sequence listing is forwarded 
herewith. 

At page 1, after the title and prior to the "Background of the Invention" section, please 
insert the following: 

—Cross-reference to Related Application 

This application is a divisional of U.S. Application No. 08/841,636, filed April 30, 
1997 (pending), which is a continuation of International Application No. PCT/FI96/00550, 
filed October 17, 1996, and a continuation-in-part of U.S. Application No. 08/732,181, filed 
October 16, 1996 (abandoned), which claims the benefit of U.S. Provisional Application Nos. 
60/005,335, filed October 17, 1995; 60/007,926, filed December 4, 1995; and 60/020,840, 
filed June 28, 1996. 



-4- 



Miettinen-Oinonen et ah 
Appl. No. To Be Assigned 



Please amend the following paragraphs/sections as follows. 

Please amend the paragraph beginning on page 7, line 21, as follows: 

Figure 1 7 shows amino acid sequence data derived from sequencing the 20K-cellulase 
described in the exemplary material herein. Sequence 429 (SEP ID NO:l) is from the N 
terminus of the protein and the other sequences are from internal tryptic peptides. Sequence 
#430 corresponds to SEP ID NO: 2; sequence #431 corresponds to SEP ED NO: 3; sequence 
#432 corresponds to SEP ID NO: 4; sequence #433 corresponds to SEP ID NO: 5; sequence 
#439 corresponds to SEP ID NO: 6; fr 9 corresponds to SEP ID NO: 7; ft 14 corresponds to 
SEP ID NP: 8; fr 16 corresponds to SEP ID NP: 9; fr 17 corresponds to SEP ID NP: 10; fr 
28 corresponds to SEP ID NP: 1 1 and fr 30 corresponds to SEP ID NP: 12. 

Please amend the paragraph beginning on page 7, line 28, as follows: 

Figure 19 (A and B) shows the DNA sequence of the 20K-cellulase gene (SEP ID 
NP: 30) . The arrow indicates the predicted signal peptidase processing site. 

Please amend the paragraph beginning on page 8, line 4, as follows: 

Figure 21 (A and B) (A, B and C) shows the DNA sequence of the 50K-cellulase gene 
(SEP ID NP: 32) . The arrow indicates the predicted signal peptidase processing site. 

Please amend the paragraph beginning on page 8, line 9, as follows: 

Figure 23 (A and B) (A, B and C) shows the DNA sequence of the 50K-cellulase B 
gene (SEP ID NO: 34) . The arrow indicates the predicted signal peptidase processing site. 
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Please amend the paragraph beginning on page 8, line 16, as follows: 

Figure 27 shows the DNA sequence of the protein-with-CBD cellulase gene (SEP ID 

NO: 36) in pALK1230. 

Please amend the paragraph beginning on page 18, line 3, as follows: 

A nucleic acid molecule encoding a polypeptide having the enzymatic activity of a 
cellulase, selected from the group consisting of: 

(a) nucleic acid molecules encoding a polypeptide comprising the amino acid 
sequence as depicted in Figure 19 (SEP ID NO: 31) or 21 (SEP ID NO: 33) ; 

(b) nucleic acid molecules encoding a polypeptide comprising the amino acid 
sequence as depicted in Figure 23 (SEP ID NO: 35) or 27 (SEP ID NP: 37) ; 

(c) nucleic acid molecules comprising the coding sequence of the nucleotide 
sequence as depicted in Figure 19 (SEPIDNP: 30) or 21 (SEPIDNP: 32) ; 

(d) nucleic acid molecules comprising the coding sequence of the nucleotide 
sequence as depicted in Figure 23 (SEP ID NP: 34) or 27 (SEP ID NP: 36) ; 

(e) nucleic acid molecules encoding a polypeptide comprising the amino acid 
sequence encoded by the DNA insert contained in DSM 11024, DSM 11012, DSM 11025 or 
DSM 11014; 

(f) nucleic acid molecules encoding a polypeptide comprising the amino acid 
sequence encoded by the DNA insert contained in DSM 1 1026, DSM 11011, DSM 1 1013 or 
DSM 11027; 

(g) nucleic acid molecules comprising the coding sequence of the DNA insert 
contained in DSM 11024, DSM 11012, DSM 11025 or DSM 11014; 
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(h) nucleic acid molecules comprising the coding sequence of the DNA insert 
contained inDSM 11026, DSM 11011, DSM 11013 or DSM 11027; 

(i) nucleic acid molecules hybridizing to a molecule of any one of (a), (c), (e) or 
(g); and 

(j) nucleic acid molecules the coding sequence of which differs from the coding 
sequence of a nucleic acid molecule of any one of (a) to (i) due to the degeneracy of the 
genetic code; and 

(k) nucleic acid molecules encoding a polypeptide having cellulase activity and 
having an amino acid sequence which shows at least 80% identity to a sequence as depicted 
in Figure 19 (SEP ID NO: 31) . 21 (SEP ID NO: 33) . 23 (SEP ID NO: 35) or 27 (SEP ID 
NO: 37) . 

Please amend the paragraph beginning on page 48, line 16, as follows: 

Amino acid sequences of tryptic peptides derived from 20K-cellulases are shown in 
Figure 17. Sequence #429 corresponds to SEP ED NO: 1; sequence #430 corresponds to 
SEQ ID NO: 2; sequence #431 corresponds to SEP ID NO: 3; sequence #432 corresponds to 
SEP ID NO: 4; sequence #433 corresponds to SEP ID NO: 5; sequence #439 corresponds to 
SEP ID NP: 6; fr 9 corresponds to SEP ID NP: 7; fr 14 corresponds to SEP ID NO: 8; fr 16 
corresponds to SEP ID NP: 9; fr 17 corresponds to SEP ID NP: 10; fr 28 corresponds to 
SEP ID NP: 1 1 and fr 30 corresponds to SEP ID NP: 12. 
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Please amend the paragraph beginning on page 53, line 14, as follows: 

Table IX 

Sequences of peptides isolated from the 50K-cellulase (uncertain residues in lower case) 

#507 (SEP ID NO: 13) VYLLDETEHR 

#509 (SEP ID NO: 14) XXLNPGGAYYGT 

#563 (SEP ID NO: 15) MsEGAECEYDGVCDKDG 

#565 (SEPIDNP: 16) NPYRVXITDYYGNS 

#603 f SEPIDNP: 17) DPTGARSELNPGGAYYGTGYXDAQ 

#605 (SEPIDNP: 18) XXVPDYhQHGVda 

#6 1 0 (SEPIDNP: 19) NEMDIXE ANSRA 

#6 1 1 (SEP ID NP: 20) LPXGMNSALYLSEMDPTGARSELNP 

#612 (SEPIDNP: 21) VEPSPEVTYSNLRXGEIXgXF 

#6 1 9 (SEP ID NP: 22) DGCGWNP YRVvITtD YYnN 

#620 (SEP ID NP: 23) LPCGMXSALY 

#621 (SEPIDNP: 24) ADGCQPRTNYIVLDdLlHPXXQ 

Please amend the paragraph beginning on page 55, line 4, as follows: 

Table X 

Sequences of peptides isolated from the 50K-cellulase B (uncertain residues in lower case) 

#534 (SEP ID NP: 25) vGNPDFYGK 

#535 (SEP ID NP: 26) FGPIGSTY 

#631 (SEP ID NP: 27) LSQYFIQDGeRK 

#632 (SEP ID NP: 28) FTVVSRFEENK 

#636 (SEP ID NP: 29) HEYGTNVGSRFLYLMNGPDK 
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Please amend the paragraph beginning on page 67, line 4, as follows: 

To amplify the 20K-cellulase gene by polymerase chain reaction (PCR), a pair of 

degenerate primers based on the peptide sequences (Figure 17) (SEO ID NOS: 1-12) was 

synthesized. Primer 1 (429-32) (SEQ ID NO: 38) was derived from the amino acids #8-14 of 

the N-terminal peptide #429 (Figure 17) (SEO ID NO: 1) , and primer 2 (fr28-16) (SEO ID 

NO: 39) was designed as the antisense strand for the amino acids #2-8 of the peptide fr28 

(Figure 17) (SEQIDNO: 11) . Additional EcoKl restriction sites were added at the S'-termini 

to facilitate the cloning of the amplified fragment. 

Please amend the paragraph beginning on page 67, line 12, as follows: 
Primer 1 (429-32 KSEO ID NO: 38) 

Please amend the paragraph beginning on page 67, line 17, as follows: 
Primer 2 (fr28-16) (SEO ID NO: 39) 

Please amend the paragraph beginning on page 69, line 7, as follows: 

The insert (594 bp) in pALK549 was found to encode the majority of the 20K- 
cellulase derived peptide (Figure 17) (SEO ID NOS: 1-12) . The PCR amplified DNA (in 
addition to the primers) corresponds to the nucleotides 175-716 in Figure 19 (A and B)(SEQ 
ID NO: 30) . 

Please amend the paragraph beginning on page 70, line 18, as follows: 

The Melanocarpus albomyces DNA in pALK1221 was sequenced as described in 
Example 19. The DNA sequence encoding the Melanocarpus albomyces 20K-cellulase is 
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shown in Figure 19 (A and B)(SEO ID NO: 30) . The sequence is 936 bp in length, and has 
an open reading frame (ORF) coding for 235 amino acids; the gene has two introns. The 
putative signal peptide processing site is after alanine-21, and the N- terminus of the mature 
protein begins at alanine-22, as suggested by the peptide sequencing results (Figure 17, 
peptide #429) (SEO ID NO: 1) . The ORF predicts a protein with a molecular weight of 25.0 
kDa for the full-length preprotein, and 22.9 kDa for the mature protein. This is in good 
agreement with the results obtained from the protein purification work (Example 10). These 
results also verify that the about 35 kDa protein detected previously with the 20K-cellulase 
antiserum (Example 10) is a different gene product than the 20K-cellulase. 

Please amend the paragraph beginning on page 71, line 14, as follows: 

The peptides derived from the 50K-cellulase (Table IX) shared some homology 
towards Humicola grisea endonuclease I (DDBJ:D63516). To amplify the 50 K-cellulase 
gene by polymerase chain reaction (PCR) a pair of degenerate primers based on the peptide 
sequences (Table IX) (SEQ ID NOS: 13-24) was synthesized. Primer 1 (507-128) (SEO ID 
NO: 40) was derived from the amino acids #5-10 of the peptide #507 (Table DO fSEO ID 
NO: 13) , and primer 2 (509-rev) (SEO ID NO: 41) was designed as the antisense strand for 
the amino acids #4-9 of the peptide 509 (Table IX) (SEO ID NO: 14) . The order of the two 
peptides in the protein-and the corresponding sense-antisense nature of the primers- was 
deduced from comparison with the Humicola grisea endonuclease I. 



Please amend the paragraph beginning on page 71, line 23, as follows: 
Primer 1 (507-128) (SEO ID NO: 40) 
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Please amend the paragraph beginning on page 72, line 1, as follows: 
Primer 2 (509-rev) (SEP ID NO: 41) 



Please amend the paragraph beginning on page 73, line 10, as follows: 

The insert (161 bp) in pALK1064 was sequenced as described in Example 19, and 
was found to contain an PRF, which predicted a peptide homologous to Humicola grisea 
endoglucanase I (DDBJ:D63516). The ORF also encoded the peptide #612 (Table DO (SEO 
ID NO: 21) from the purified 50K-cellulase. The PCR amplified DNA (in addition to the 
primers) corresponds to the nucleotides 404-530 in Figure 21 (SEP ID NO: 32) . 



Please amend the paragraph beginning on page 74, line 15, as follows: 

The DNA encoding the Melanocarpus albomyces 50K-cellulase is shown in Figure 21 
(A and B) (A, B and C)(SEP ID NO: 32) . The sequence reveals an ORF of about 1363 bp in 
length, interrupted by one intron. The ORF codes for 428 amino acids. The predicted protein 
has a molecular weight of 46.8 kDa and after signal peptide cleavage of 44.8 kDa. All the 
peptides in Table IX (SEP ID NOS: 13-24) are found in the predicted protein sequence 
(Figure 2) (Figure 21XSEP ID NP: 33.) . although some amino acids identified with 
uncertainty during the peptide sequencing proved to be incorrect. The protein shows 
homology to Humicola grisea endoglucanase I (DDBJ:D63516). 



Please amend the paragraph beginning on page 74, line 26, as follows: 

The peptides derived from the 50K-cellulase B (Table X) (SEP ID NPS: 25-29) 
shared some homology towards Humicola grisea cellobiohydrolase I (DDBJ:D63515). To 
amplify the 50K-cellulase B gene by polymerase chain reaction (PCR) a pair of degenerate 
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primers based on the peptide sequences (Table X) (SEQ ID NOS: 25-29) was synthesized. 
Primer 1 (636) (SEO ID NO: 42) was derived from the amino acids #1-5 of the peptide #636 
(Table X) (SEQ ID NO: 29) (the first amino acids was guessed to be lysine, because the 
peptide was isolated after digestion with a protease cleaving after lysines), and primer 2 (534- 
rev) (SEO ID NO: 43) was desinged as the antisense strand for the amino acids #3-8 of the 
peptide #534 (Table X) (SEO ID NO: 25) . The order of the two peptides in the protein-and 
the corresponding sense-antisense nature of the primers-was deduced from comparison with 
the Humicola grisea cellobiohydrolase I. 

Please amend the paragraph beginning on page 75, line 8, as follows: 
Primer 1 (636 KSEO ID NO: 42) 

Please amend the paragraph beginning on page 75, line 1 1, as follows: 
Primer 2 (534-rev) (SEO ID NO: 43) 

Please amend the paragraph beginning on page 76, line 21, as follows: 

The insert in pALK1224 was sequenced as described in Example 19, and was found 
to contain an ORF encoding the whole peptide #636 (SEP ID NO: 29) from 50K-cellulase B 
(Table X). The ORF predicted a peptide homologous to Humicola grisea cellobiohydrolase I 
(DDBJ:D63515). The PCR amplified DNA (in addition to the primers) corresponds to the 
nucleotides 371-1023 in Figure 23 (A, B and C)(SEO ED NO: 34) . 
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Please amend the paragraph beginning on page 77, line 29, as follows: 

Part of the inserts in pALK1229 and pALK1236 were sequenced as described in 

Example 19. The DNA encoding the Melanocarpus albomyces 50K-cellulase B is shown in 

Fieuro 23 (A and B) (Figure 23A, B and C)(SEO ID NO: 34) . The sequence reveals an ORF 

of 1734 bp in length interrupted by five introns. The ORF codes for 452 amino acids. The 

predicted protein has a molecular weight of 49.9 kDa and after signal peptide cleavage of 

47.6 kDa. All the peptides in Table X (SEP ID NOS: 25-29) are found in the predicted 

protein sequence (Figur e 23A and B X Figure 23 A, B and C)(SEQ ID NO: 2>5\ although some 

amino acids identified with uncertainty during the peptide sequencing proved to be incorrect. 

The predicted protein shows homology to Humicola grisea cellobiohydrolase I 

(DDBJ:D63515) and other cellobiohydrolases. However, 50K-cellulase B has the surprising 

feature that it does not harbor the cellulose binding domain (CBD) and its linker, which is 

characteristic to Humicola grisea cellobiohydrolase I and many other cellobiohydrolases. 

Please amend the paragraph beginning on page 79, line 18, as follows: 

Part of the insert in pALK1230 was sequenced as described in Example 19. The 
DNA appears not to encode the 20K-cellulase, but codes for a protein homologous to several 
cellulases, particularly at the cellulose binding domain (CBD) area. Thus the gene product 
very likely has high affinity towards cellulosic material, and therefor this gene product was 
designated as protein-with-CBC. The sequence is shown in Figure 27 (SEP ID NO: 36) . 
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Please amend the paragraph beginning on page 79, line 24, as follows: 

PCR reactions with the primers 636 (SEP ID NO: 42) and 534-rev (SEP ID NO: 43) 

(Example 23) were performed with the DNA from the 19 lambda clones as templates. One 

lambda clone, lambda-3, gave a band about 700 bp in size, similar to that in Example 23 

when ALK04237 chromosomal DNA was used as a template. This clone had originally been 

picked by the Trichoderma cbhl probe. The lambda DNA was digested with several 

restriction endonucleases, and hybridized with the 50K-cellulase B specific probe. The clone 

showed similar restriction enzyme pattern as the 3 clones in Example 24. It is concluded that 

lambda -3 also carries the 50K-cellulase B gene. 

Please amend the paragraph beginning on page 85, line 3, as follows: 

*71 reesei cbhl (cellobiohydrolase 1) promoter: The promoter is from Trichoderma 
reesei VTT-D-80133 (Teeri et al. (1983) The molecular cloning of the major cellulase gene 
from Trichoderma reesei. Bio/Technology 1: 696.). The 2.2 kb EcoRl-Sacll fragment 
(Karhunen et al. (1993) High frequency one-step gene replacement in Trichoderma reesei. I. 
Endoglucanase I overproduction. Mol. Gen. Genet. 241:515) was used in the construct. The 
sequence of the promoter area preceding the ATG was published by Shoemaker et al. (1983) 
Molecular cloning of exo-cellobiohydrolase from Trichoderma reesei strain L27. 
Bio/Technology 1.691.). The last 15 nucleotides of the T. reesei L27 cbhl promoter (the 
Sacll site is underlined) are CCGCGG ACTGGCATC (SEP ID NO: 44) (Shoemaker et al. 
1983). The cbhl promoter from the T reesei strain VTT-D-80133 has been sequenced at 
Alko Research Laboratories, and one nucleotide difference in the DNA sequence has been 
noticed within the above mentioned region. In the T. reesei strain VTT-D-80133 the 
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sequence preceding the ATG is CCGCGG ACTG/C/GCATC (SEP ID NO: 45) (the SacU site 
is underlined, the additional cytosine in the DNA sequence is between the slashes). 

Please amend the paragraph beginning on page 85, line 28, as follows: 

*Melanocarpus albomyces 20K-cellulase gene: The nucleotide sequence and deduced 
amino acid sequence of the 20K-cellulase gene encoding a 20 kDa cellulase is presented in 
Example 20 (Figure 19 KSEO ID NOS: 30-31) . A 0.9 kb fragment beginning from ATG- 
codon was used in both plasmids. 



